Similarity Retrieval Based on SOM-Based R*-Tree
نویسندگان
چکیده
Feature-based similarity retrieval has become an important research issue in multimedia database systems. The features of multimedia data are useful for discriminating between multimedia objects (e g documents, images, video, music score, etc.). For example, images are represented by their color histograms, texture vectors, and shape descriptors, and are usually high-dimensional data. The performance of conventional multidimensional data structures(e g; R-Tree family K-D-B tree, grid file, TV-tree) tends to deteriorate as the number of dimensions of feature vectors increases. The R -tree is the most successful variant of the R-tree. In this paper we propose a SOM-based R -tree as a new indexing method for high-dimensional feature vectors. The SOM-based R -tree combines SOM and R -tree to achieve search performance more scalable to high dimensionalities. Self-Organizing Maps (SOMs) provide mapping from high-dimensional feature vectors onto a two dimensional space. The mapping preserves the topology' of the feature vectors. The map is called a topological of the feature map, and preserves the mutual relationship (similarity) in the feature spaces of input data, clustering mutually similar feature vectors in neighboring nodes. Each node of the topological feature map holds a codebook vector A best-matching-image-list. (BMIL) holds similar images that are closest to each codebook vector In a topological feature map, there are empty nodes in which no image is classified When we build an R -tree, we use codebook vectors of a topological feature map which eliminates the empty nodes that cause unnecessary disk access and degrade retrieval performance. We experimentally compare the retrieval time cost of a SOM-based R-tree with that of an SOM and an R-tree using color feature vectors extracted from 40,000 images. The results show that the SOM-based R-tree outperforms both the SOM and R-tree due to the reduction of the number of nodes required to build R-tree and retrieval time costs.
منابع مشابه
SOM-Based R*-tree for Similarity Retrieval
Feature-based similarity retrieval has become an iniportant research issue in multimedia database systems. The features of multimedia data are useful for discriminating between multimedia objects (e.g., documents, images, video, music score, etc.). For example, images are represented by their color histograms, texture vectors, and shape descriptors. A feature vector is a vector that represents ...
متن کاملPicSOM: self-organizing maps for content-based image retrieval
Digital image libraries are becoming more common and widely used as more visual information is produced at a rapidly growing rate. Content-based image retrieval is an important approach to the problem of processing this increasing amount of data. It is based on automatically extracted features from the content of the images, such as color, texture, shape, and structure. We have started a projec...
متن کاملOptimized Content based Image Retrieval System based on Multiple Feature Fusion AlgorithmL
Recent years have envisaged a sudden increase in the use of multimedia content like images and videos. This increase has created the problem of locating desired digital content from a very large multimedia database. This paper presents an optimized Content Based Image Retrieval (CBIR) system that uses multiple feature fusion and matching to retrieve images from a image database. Three features,...
متن کاملBiomedical CBIR using "bag of keypoints" in a modified inverted index
This paper presents a “bag of keypoints” based medical image retrieval approach to cope with a large variety of visually different instances under the same category or modality. Keypoint similarities in the codebook are computed using a quadratic similarity measure. The codebook is implemented using a topology preserving SOM map which represents images as sparse feature vectors and an inverted ...
متن کاملA combination of Wilcoxon test and R-estimates for document organization and retrieval
The Wilcoxon signed-rank test is exploited for document organization and retrieval in this paper. A novel modeling method for documents and a distance metric between documents are proposed. Both document modeling and document comparisons are based on signed-ranks and are applied to the frequency of occurrence of the document bigrams. A metric using the Wilcoxon signed-rank test exploits these s...
متن کامل